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(57) Abstract 



The invention provides a recombinant virus vector for use as an immunotherapeutic or vaccine. The recombinant virus vec- 
tor comprises at least one pair of nucleotide sequences heterologous to the virus and which have sufficient sequence homology 
that recombination between them might be expected. The pair of nucleotide sequences are arranged in the virus vector such that 
they are inverted with respect to each other. The virus vector is able to infect a mammalian host cell and express as polypeptide 
the heterologous nucleotide sequences in the host cell. For infection thought to be caused by HPV infection, the pair of nucleotide 
sequences encode part or all of human papillomavirus (HPV) wild-type proteins or mutant proteins immunologically cross-reac- 
tive therewith. For an immunotherapeutic or vaccine against cervical cancer, the recombinant virus vector encodes part or all of 
the HPV wild-type proteins HPV16E7 and HPV18E7 or mutant proteins immunologically cross- reactive therewith. 
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RECOMBINANT VIRUS VECTORS ENCODING HUMAN PAPILLOMAVIRUS PROTEINS 
This invention relates to recombinant virus vectors. 
In particular, it relates to recombinant virus vectors 
designed to overcome the problem of recombination 



relates to recombinant virus vectors encoding human 
papillomavirus proteins; to immunotherapeutics and 
vaccines for conditions associated with HPV infection; 
to the production of a virus (e.g. vaccinia virus) 

10 engineered to express antigens encoded by human 
papillomavirus types 16 and 18 and to 
immunotherapeutics and vaccines for cervical cancer. 

In recent years, strong evidence has been adduced 
for a link between cervical carcinoma and infection 

15 with certain types of human papillomavirus (HPV), 
particularly with types 16, 18, 31, 33 and 35 (Gissman 
et al.. Cancer Cells 5,275, 1987). This is based on 
hybridisation studies which have indicated that more 
than 85-90% of biopsies from cervical tumours can be 

20 shown to contain papillomavirus DNA. HPV16 DNA is 
most commonly found ( in about 60% of tumours ) with 
HPV18 the next most frequent (about 20%) and the other 
types accounting for a further 5-10%. In many 
instances, tumour cells from the biopsies do not 

25 however, contain the complete genome, but rather a 
deleted form. The extent and location of the deleted 
information within the virus genome is variable, but 
a general feature is the retention of the part of the 



5 between homologous nucleotide sequences. 



It also 
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genome encoding -the E7 protein ( Schwarz et al • , Na1:ure 



E6-encoding region is usually present: • The ubiquitous 
presence of the E7-encoding region in txanour cells 
5 suggests that the protein product of this gene might 
play a role in the induction or maintenance of the 
transformed phenotype. Indeed in most cell lines 
established from tumour biopsies, expression of the E7 
gene can be detected (Smotkin & Wettstein, PNAS, 83, 
10 4680, 1986). Furthermore, it has been shown that the 
E7 gene product can bind to the retinoblastoma (Rb) 
gene product, a recognised " ant i -oncogene" in normal 
human cells (Hunger et alo, EMBO J. 8,4099, 1989). 
This strengthens the belief that E7 is directly 
15 involved in cell transformation. 

The presence and expression of the E7 and E6 
genes in tumour cells derived from cervical carcinoma 
biopsies, suggests the possibility that these proteins 
could be potential targets for the immunological 
20 recognition of the tumour cells. It is well known 
that viral proteins produced inside mammalian cells 
can be processed through a host cell pathway to short 
peptides, which then form a complex with host Major 
Histocompatibility Complex (MHC) Class 1 molecules and 
25 are transported to the cell surface. These complexes 
may then present a target for recognition by the host 
immune system. Interaction of the complex with the 
receptor molecule on the surface of cytotoxic T cells 



314, 



111, 



1985). 



In addition. 



the ad j acent 
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(the T cell receptor) can then lead to activation of 
the T cells to proliferate or to destroy the 
recognised cell. It is possible, therefore, that the 
presence in the body of a population of cytotoxic T 
5 lymphocytes (CTLs) which are capable of recognising 
cells expressing the HPV E6 and/or E7 proteins could 
afford protection against the development and 
proliferation of cervical tumours. Indeed it has been 
reported that normally oncogenic mouse cells 
10 engineered to express the HPV E7 protein are unable to 
form tumours in mice which have been previously 
immunised with non- tumor igenic E7-expressing cells, 
and that this rejection is mediated by CD8+ 
lymphocytes (CTLs) (Chen et al., PNAS 88, 110, 1991). 
15 Further, the generation of an active population of 
such cells subsequent to tumour initiation could 
result in regression of the tumour. 

There are numerous reports on the construction of 
recombinant viruses eg vaccinia viruses containing, 
20 and expressing foreign genes (Mackett & Smith, J. gen. 
Virol. 67,2067, 1986), and several reports of the use 
of these recombinant viruses to generate effective 
immune responses against the expressed foreign 
antigens. A particular advantage of this route for 
25 delivery of antigens for vaccination is that it may 
lead to the development of cellular as well as humoral 
immunity. This is because the foreign proteins will be 
produced inside cells of the infected individual in a 
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manner similar to that which occurs during natural 
infection • This means that they should be processed 
through the correct pathway to allow generation of a 
CTL response • In several cases , it has been 
5 demonstrated directly that immunisation with the 
recombinant virus is capable of producing a cellular 
immune response in the form of foreign 
antigen-specific CTLs ( Moss & Flexner , Ann • Rev o 
Immunol - , 5 , 305 , 1987 ) • Furthermore , vaccination of 
10 animals with recombinant vaccinia viruses expressing 
certain tumour-specific antigens, such as the human 
melanoma-associated antigen P97 (Est in et al., PMAS, 
85, 1052, 1988), the bovine papillomavirus E7 protein 
(Meneguzzi et al,. Vaccine, 8, 199, 1990) and the 
15 human breast cancer-associated antigen ETA (Hareuveni 
et alo, PNAS, 87, 9498, 1990) has been demonstrated to 
result in the induction of immunity against tumour 
initiation and progression. 

The present applicants have recognised the 
20 desirability of producing a recombinant virus vector 
which is useful as an immunotherapeutic or vaccine for 
conditions caused by HPV infection, for example for 
cervical cancer. With respect to cervical cancer, the 
art at the time of the applicants making the present 
25 invention recognised the E7 gene as having the 
potential to immortalise cells . Therefore, it would 
be felt inappropriate to incorporate the E7 gene in an 
immunotherapeutic . The applicants however , have 
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recognized that the beneficial effects to be gained by 
treatment with an immunotherapeutic comprising the E7 
5 gene are likely to outweigh by far any risk associated 
with the oncogenic activity of the E7 gene. Thus, an 
aspect of the applicants invention involves the use of 
a recombinant virus vector which expresses an E7 gene, 
as an immunotherapeutic or vaccine. Furthermore, the 

10 applicants provide embodiments of their invention in 
which these risks are reduced still further by 
specific alteration of the gene sequences in order to 
reduce the oncogenic potential of the E7 gene without 
compromising its ability to stimulate an appropriate 

15 immune response. 

The present applicants have also recognized that 
where a number of HPV proteins which may be encoded by 
different HPV strains are implicated as being 
associated with a particular HPV-associated condition 

20 (for example, cervical carcinoma, HPV16 nd HPV18; 
genital warts, condyloma acuminata, respiratory 
papillomatosis, HPV6 and HPVll; squamous cell 
carcinoma in immunosuppressed individuals, HPV5 and 
HPV8 ) , rather than produce a plurality of recombinant 

25 viruses engineered separately to express each of the 
implicated proteins, it would be advantageous to 
produce a single virus recombinant which is able to 
express part or all of the sequences of more than one 



E7 gene in an immunotherapeutic. 



They have also 
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of the proteins • Thus, with respect to cervical 
cancer, rather than produce four recombinant viruses 
engineered separately to express each of the potential 
targets for immunological recognition of cervical 
5 tumour cells i.e. the HPV16 E6, HPV16 E7, HPV18 E6 and 
HPV18 E7 proteins, it would be especially advantageous 
to produce a single virus recombinant which is able to 
express the part or all of the sequences of more than 
one of the proteins, preferably at least two of the 
10 proteins and most preferably all four proteins. That 
the present applicants are able to achieve this is 
particularly surprising • This is because, the coding 
sequences for many HPV proteins are highly homologous 
to other equivalent HPV proteins ( for example from 
15 other virus strains). Thus, the HPV16 E6 and HPV18 E6 
proteins show overall homology of 62% and comprise 
regions of very high homology. The same is true for 
HPV16 E7 and HPV18 E7 which show overall homology of 
57%, with particular regions of very high homology. 
20 This means that one would expect recombination to 
create problems such as loss of gene sequences . The 
applicants have, however, devised a novel strategy 
designed to minimise the likelihood of such 
recombination events and to circumvent the deleterious 
25 effect of those events should they indeed arise. 
Thus, surprisingly, the invention provides recombinant 
virus vectors which comprise at least one pair of 
nucleotide sequences which have sufficient sequence 
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homology that recombination between them might be 



sequences may encode part or all of human 
papillomavirus (HPV) wild- type proteins or mutant 
5 proteins immunologically cross-reactive therewith. In 
particular, the invention provides a recombinant 
vector which can maintain stably, and express, part or 
all of four of the desired gene sequences from HPV16 
and HPV18. 

10 Thus, the present invention provides a 

recombinant virus vector for use as an 
immunotherapeutic or vaccine which comprises at least 
one pair of nucleotide sequences heterologous to said 
viarus and which have sufficient sequence homology that 

15 recombination between them might be expected wherein 
said pair of nucleotide sequences are arranged in said 
virus vector such that they are inverted with respect 
to each other to reduce the likelihood of 
recombination events leading to loss of part or all of 

20 said sequence and said virus vector is able to infect 
a mammalian host cell and express as polypeptide the 
heterologous nucleotide sequences in said host cell. 
The at least one pair of nucleotide sequences may 
encode part or all of human papillomavirus (HPV) wild- 

25 type proteins or mutant proteins immunologically 
cross-reactive therewith. The pair of nucleotide 
sequences may encode parr or all of the protein E7 
from both HPV16 and HPV18 or functional equivalents 



expected . 



The at least one pair of nucleotide 
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thereof. The pair of nucleotride sequences may encode 
part or all of the proteins E6 from both HPV16 and 
HPV18 or functional equivalents thereof - 

The recombinant virus vector may comprise a 
5 further pair of nucleotide sequences heterologous to 
said virus and which ( i ) have sufficient sequence 
homology that recombination between them might be 
expected wherein said further pair of nucleotide 
sequences are arranged in said virus vector such that 
10 they are inverted with respect to each other and said 
virus vector is able to infect a mammalian host cell 
and express as polypeptide the further pair of 
heterologous nucleotide sequences in said host cell. 
The further pair of nucleotide sequences may 
15 encode part or all of HPV wild- type proteins or mutant 
proteins immunologically cross-reactive therewith. 

For example, the present invention also provides 
a recombinant virus vector which in addition to the E7 
coding . sequences , also comprises and is adapted to 
20 express genetic sequences encoding part or all of the 
protein E6 from both HPV16 and HP VI 8 or functional 
equivalents thereof. The genetic sequences may 
comprise sequences encoding HPV16 E6/E7 and HPV18 
E6/E7 as shown in figures 1(a) and 1(b) 
2 5 respectively . 

The genetic sequences may encode an antigenic 
moiety of the said proteins. 

Either or both of the nucleotide sequences in a 
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pair of nucleotide sequences may be altered to make 
them less homologous than an equivalent pair of 
nucleotide sequences encoding wild-type HPV proteins. 
The alteration in nucleotide sequence may be in an 



alteration in nucleotide sequence will not result in 
an alteration of the encoded amino acid sequence. 

Two or more nucleotide sequences each encoding 
separate proteins may be fused together to form a 
10 single open reading frame. Thus the genetic sequences 
encoding part or all of the proteins E6 and E7 from 
HPV16, may be fused together to form a single open 
reading frame. The genetic sequences encoding part or 
all of the proteins E6 and E7 from HPV18, may be fused 

15 together to form a single open reading frame. The 
genetic sequences encoding part or all of the proteins 
E6 and E7 from both HPV16 and HPV18, may be fused 
together to form a single open reading frame. Thus, 
the recombinant virus vector may have the pairs of 

20 nucleotide sequences arranged according to any one of 
the options shown in Figure 26. Where the recombinant 
virus vector comprises an open reading frame having a 
fused genetic sequence encoding part or all of the 
proteins E6 and E7 from HPV16, and a separate open 

25 reading frame having fused genetic sequences encoding 
part or all of the proteins E6 and E7 from HPV18, the 
two open reading frames may be inverted with respect 
to one another. For example, the two open reading 



5 area of high sequence homology. 



Preferably, the 
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frames may be arranged ±n the recombinant virus vector 
adjacent to each other. The inversion may be such 
that the E6 coding sequences of HPV16 and HPV18 are 
both located between the E7 coding sequences of HPV16 
5 and HPV18. Alternatively, the inversion could be such 
that the E7 coding sequences of HP VI 6 and HP VI 8 are 
both located between the E6 coding sequences of HPV16 
and HPV18. In particular the two open reading frames, 
each with its respective promoter, may be arranged 
10 next to each other in the recombinant vector. In this 
case the promoters may be located between the genes, 
which are transcribed outwardly, or the promoters may 
be located outside the genes, which are transcribed 
inwardly. 

15 Similarly, the genetic sequences encoding part or 

all of the E7 protein from HPV16 and the E7 protein 
from HP VI 8 may be fused together to form a single open 
reading frame. The genetic sequences encoding part or 
all of the E6 protein from HPV16 and the E6 protein 
20 from HPV18 may be fused together to form a single open 
reading frame. This leads to another range of 
arrangements similar to those shown in Figure 26. The 
fusions may be via a single codon encoding a 
relatively small neutral amino acid e.g. glycine. 
25 Thus the present invention also provides a 

recombinant virus vector which comprises a first open 
reading frame having a fused genetic sequence encoding 
part or all of the wild-type proteins E6 and E7 from 
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HPV16; and a separate second open reading frame having 



a fused genetic sequence encoding part or all of the 
wild-type proteins E6 and E7 from HPV18; wherein the 
first and second open reading frames may be inverted 
5 with respect to one another whereby either: i) the E6 
coding sequences of HPV16 and HPV18 are both located 
between the E7 coding sequences of HPV16 and HPV18; or 
ii) the E7 coding sequences of HPV16 and HPVIB are 
both located between the E6 coding sequences of HPV16 
10 and HPV18; and wherein any of said wild-type proteins 
may be replaced by a mutant protein immunologically 
cross-reactive therewith, 

Each of the first and second open reading frames 
may have a corresponding promoter and the two open 
15 reading frames each with its promoter, are arranged 
next to each other in the virus. 

The present invention also provides a recombinant 
virus vector wherein either: i) the promoters are 
located between the first and second reading frames 
20 whereby the open reading frames are transcribed 
outwardly; or ii ) the promoters are located outside 
the first and second open reading frames whereby the 
open reading frames are transcribed inwardly. 



25 virus vector which comprises a first open reading 
frame having a fused genetic sequence encoding part or 
all of the wild-type proteins E6 and E7 from HPV16; 
and a separate second open reading frame having a 



The present invention also provides a recombinant 
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fused genetic sequence encoding part: or all of trhe 
wild-type proteins E6 and E7 from HPV18; wherein the 
E6 coding sequences of HPV16 and HPV18 are both 
located between the E7 coding sequences of HPV16 and 
5 HPV18; and each open reading frame has a corresponding 
promoter, the promoters being located between the 
first and second open reading frames whereby the open 
reading frames are transcribed outwardly; and wherein 
any of said wild- type proteins may be replaced by a 
10 mutant protein immunologically cross -reactive 
therewith . 

The wild-type proteins HPV16E7 and HPV18E7 may be 
replaced with mutant proteins which are substantially 
homologous to said wild-type proteins and in which the 
15 residues cys 24 and glu 26 of wild-type protein 
HPV16E7 and the residues cys 27 and glu 29 of wild- 
type protein HPV18E7 are replaced with glycine 
residues . 



20 from vaccinia virus. 

The applicants have also recognized that for 
effective function as an immunotherapeutic, it is 
desirable for the recombinant virus to retain its 
ability to replicate and thereby generate an active 

25 infection in order that a cellular immune response may 
be mounted against the virus-encoded proteins. Thus, 
the applicants propose that the foreign gene sequences 
should be inserted into the vector virus at sites, the 



The recombinant virus vector may be derivable 
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disruption of which by the insertion of the 
heterologous gene sequences will not substantially 
interfere with, and therefore have a substantially 
adverse affect on any viral functions which relate to 
5 the replicative ability of the virus in the infected 
host animal. The applicants have named these sites 
' neutral sites ' ( although the term • neutral ' should 
not be interpreted strictly as it is acknowledged that 
the disruption of these sites may have a small, but 
10 relatively speaking inconsequential adverse affect on 
replicative ability). 

DNA sequences which affect virus replication can 
fall into several categories: 
i) protein coding sequences; 

15 ii ) elements involved in control of gene expression; 
and iii ) elements involved in virus DNA replication 
A non-essential and neutral insertion site must 
therefore avoid such regions, and, such sites have 
been identified on 
20 the basis of nucleotide sequencing studies. Thus the 
genetic sequences may be inserted into neutral sites 
within the virus genome. One or more genetic 
sequences may be inserted into the same neutral site. 

Neutral sites can be easily tested for according 
25 to techniques well known in the art. For example, a 
site may be selected, interrupted or deleted using 
standard methodologies and the resultant recombinant 
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virus placed in conditions which normally support 
growth of the wild type virus vector, to assess the 
effect of the manipulations- The pathogenicity of the 
virus may be further compared with that of the 
5 unmodified virus vector strain in animal models, in 
order to assess its level of attenuation. 

In the present invention, the virus vector may be 
vaccinia virus. The vaccinia virus may be attenuated 
or disabled so that it is unable to fully replicate 
10 and esablish an extensive infection of host cells. 

Vaccinia virus has been used extensively in the 
past for vaccination against smallpox , and its use 
worldwide has led to the complete eradication of the 
disease (Bhebehami, Microbiol. Rev., 47, 455, 1983). 
15 During the World Health Organisation (WHO) campaign to 
eradicate smallpox, several different strains of 
vaccinia virus were used as vaccines. In 1984 a 
meeting was sponsored by the WHO to discuss the use of 
vaccinia virus as live virus vectors (Bulletin of the 
20 WHO 63(3): 471-477). The data in this report indicates 
that the number of complications associated with 
vaccination was lowest for the Wyeth strain of 
vaccinia virus, and so this strain has been chosen as 
a basis for the construction of the recombinant virus 
25 according to an embodiment of the present invention. 

It is well known that insertion of foreign DNA 
into the genome of vaccinia virus at certain favoured 
sites, such as the thymidine kinase gene locus, can 
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reduce dramat:±cally the ability of the virus to 
replicate in vivo» As discussed above, the aim of the 
therapeutic approach described here is to generate an 
active in vivo infection, so that a cellular immune 
5 response may be mounted against the virus encoded 
proteins. The present invention provides a method for 
inserting foreign genes at neutral sites within the 
genome of a virus, the disruption of which sites by 
the insertion will not interfere with and therefore 
10 have a substantially adverse affect on virus 
replication. 

Where the virus is vaccinia virus, the neutral 
site may be identified herein within the Wyeth strain 
of vaccinia virus on the basis of the related IsJR 

15 strain nucleotide sequence. Alternatively, where 
other vaccinia virus strains are used, sites 
equivalent to those sites identified above may be 
used. The neutral sites may be any as identified 
hereinafter as A,B,C and D, or a functional 

20 equivalent. 

For successful expression of foreign proteins by 
the recombinant virus vector the foreign genes must be 
placed under the control of a promoter sequence which 
is operable by the virus. Thus the recombinant virus 

25 vector may comprise a single promoter which controls 
the expression of all the heterologous genetic 
sequences within a single open reading frame. 
Alternatively, where the recombinant virus vector 
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encodes more than one open reading frame containing 
heterologous genetic sequences, the virus may comprise 
a first promoter which controls the expression of the 
genetic sequences from a first open reading frame, and 
5 one or more further promoters which control the 
expression of the genetic sequences from one or more 
further open reading frames. The promoter sequence 
may be virus- specific and several have been 
characterised so far (Davison & Moss, J, Mol. Biol., 
10 210, 749, 1989; Davison & Moss, J. Mol. Biol., 210, 
771, 1989), The single promoter and the first and one 
or more further promoters may be the p 7.5 promoter. 
There have been reports that the induction of foreign 
antigen-specific CTLs requires expression of the 
15 antigen early in the virus replication cycle (Coupar 
et al., Eur. J. Immunol., 16, 1479, 1986). Therefore, 
a recombinant virus as provided by the present 
invention may involve the use of the p7 . 5 promoter 
(Venkatesan et al.. Cell, 125, 805, 1981) and/or the 
20 H6 promoter (Rosel et al., J. Virol, 60, 436, 1988), 
both of which are active both early and late in 
infection. 

As mentioned earlier, it has been reported that 
the E7 gene on its own has the potential to 
25 immortalise cells (Phelps et al.. Cell 53, 539, 1988). 
In an embodiment of the present invention, the 
stratecfy for expression of the protein involves 
production of E7 as a fusion protein with E6, which is 
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unlikely to retain biological function. Embodiments of 
the invention provide for reducing this risk still 
further, by making changes within the E7 gene which 
are known to destroy its oncogenic capacity (Chesters 
5 et alo, J. Gen Virol. 71, 449. 1990). Thus in the 
recombinant virus vectors of the present invention, 
the genetic sequences encoding part or all of the E7 
proteins may be altered from the equivalent wild type 
sequences, in order to render the sequences, used in 
10 the recombinant virus vectors less oncogenic than 
their equivalent wild type sequences. 

The present invention also provides 
pharmaceuticals comprising recombinant virus vectors 
as herein defined. The pharmaceutical may be for use 
15 against a condition caused by HPV infection which 
comprises an immunotherapeutically effective amount of 
a recombinant virus vector. The pharmaceutical may be 
for use against cervical cancer. 

The pharmaceutical may be a vaccine to immunise 
20 against a condition caused by HPV infection which 
comprises an amount of recombinant virus vector as 
herein provided which when administered to a recipient 
can specifically activate cells of the immune system 
to HPV proteins. The vaccine may be for immunisation 
25 against cervical cancer. 

The pharmaceuticals may comprise one or more 
excipients. The present invention also provides 
methods of using the recombinant virus vectors as 

BNSDCX^D: <WO_jg21683aA1JL;» 



I ILL 



WO 92/16636 



o 



PCr/GB92/00424 



O 



18 



herein defined to make medicaments for* use as 
immunotherapeutics or vaccines against conditions 
thought to be caused by HPV infection • For eKample 
for the prophylaxis and treatment of cervical cancer. 



treating mammalian patients with recombinant virus 
vectors and pharmaceuticals as herein provided. 

The present invention also provides a method of 
determining a neutral site in a virus vector, the 
10 disruption of which by the insertion of heterologous 
gene sequences will not interfere with, and therefore, 
have a substantially adverse affect on viral function 
which relates to the replicative ability of the virus. 
The method for this determination comprises : ( a ) 
15 analysing a viral genome to identify open reading 
frames which are likely to encode functional genes, 
by looking for expected codon usage between spaced 
apart start and stop codons; and (b) selecting sites 
which are not in such open reading frames, likely to 
20 encode functional genes, as identified in (a) This may 
include selecting sites between open reading frames 
for sequences of functional genes and selecting sites 
which are in open reading frames which have some 
functional gene characteristics, such as an expected 
25 codon usage, but have lost other essential 
characteristics such as a start codon. The method may 
also comprise interrupting or deleting the selected 
sites from the viral genome and placing the resultant 



5 



The present invention also provides methods of 
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virus in conditions which normally support: growth of 
the wild type viruFt. 

The present invention also provides neutral sites 
identified by use of the above methods. 



which shows a way of inducing a cellular Immune 
response against the papillomavirus proteins usually 
expressed in cervical tumour cells by the creation of 
a recombinant vaccinia virus, which has been 

10 engineered to produce the HPV E6 and E7 proteins, or 
proteins containing HPV E6 and E7 sequences, during 
its replication cycle. This therapeutic vaccinia 
virus contains the E6 and E7 genes from both HPV16 and 
HPV18, the viruses most commonly associated with 

15 cervical carcinoma. Vaccination with this single 
virus may thus stimulate immunity to the E6 and E7 
proteins of the HPV types associated with more than 
80% of cervical tumours. Expression of all four gene 
sequences (e.g. HPV16 E6 and E7; HPV18 E6 and E7 ) in 

20 a single virus however presents a problem, because of 
the likelihood of loss of genetic sequences through 
recombination. The present invention provides a 
method for circumventing this difficulty, firstly 
through specific sequence alteration, in order to 

25 reduce sequence homology and secondly through their 
insertion into the vaccinia virus genome in such a way 
that if such recombination were to occur, it would not 
lead to loss of sequences (i.e. in inverted 



The present invention provides an embodiment 
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orien-ta-tion with respecl: to each other ) . Expression 
of the desired four gene sequences in the vaccinia 
virus genome could also be difficult (though not 
impossible) to achieve as independent expression 
5 units ^ and so the invention provides that instead, the 
E6 and E7 open reading frames may be fused together. 
A problem with standard methods for insertion of 
foreign information into the vaccinia virus genome is 
that the use of selectable markers to increase the 
10 efficiency of recombination results in the ultimate 
presence in the recombinant virus also of the 
selectable marker gene itself* Methods for insertion 
have been developed however, which allow subsequent 
elimination of these extraneous sequences (Falkner & 
15 Moss J. Virol., 64, 3108, 1990) and these are used in 
an embodiment of the present invention to ensure that 
the final recombinant vaccinia virus has only those 
additional sequences which are necessary for its 
required function. 
20 In order that the present invention is more fully 

understood an embodiment will now be described in more 
detail with reference to the figures in which: 

Figure 1(a) shows the nucleotide sequence and 
three-frame translation of HPV16 E6/E7 polymerase 
25 chain reaction product (underlined regions indicate 
the E6 and E7 coding sequences); Figure 1(b) shows the 
nucleotide sequence and three- frame translation of 
HPV18 E6/E7 polymerase chain reaction product 
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(underlined regions indicatre the E6 and E7 coding 
sequences ) ; 

Figure 2 shows the cloning and modification of the 
HPV16 and HPV18 E6 and E7 genes; 
5 Figure 3 shows an open reading frame plot of vaccinia 
virus from positions 17201-18450 of the region covered 
by the four fragments SalF,G,H and I; short vertical 
lines denote termination codons, lines topped with 
boxes denote initiation codons, rectangles show 

10 relevant open reading frames, and arrows show 
direction of upper and lower DNA strands; 
Figure 4 shows an open reading frame plot of vaccinia 
virus from positions 21001-22000 of the region covered 
by the four fragments SalF,G,H and I; short vertical 

15 lines denote termination codons, lines topped with 
boxes denote initiation codons, rectangles show 
relevant open reading frames and arrows show direction 
of upper and lower DNA strands; 

Figure 5 shows an open reading frame plot of vaccinia 
20 virus from positions 23501-25000 of the region covered 
by the four fragments SalF,G,H and I; short vertical 
lines denote termination codons, lines topped with 
boxes denote initiations codons, rectangles show 
relevant open reading frames and arrows show the 
25 direction of upper and lower strands of DNA; 

Figure 6 shows a codon usage plot of vaccinia virus 
from positions 17201-18450 of the region covered by 
the four fragments SalF,G,H and I; arrows show 
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direction of each DNA strands- 
Figure 7 shows a codon usage plot of vaccinia virus 
from positions 21001-22000 of the region covered by 
the four fragments SalF,G,H and 1; arrows show 
5 direction of each DNA strand; 

Figure 8 shows a codon usage plot of vaccinia virus 
from positions 23501-25000 of the region covered by 
the four fragments SalF,G,H and I; arrows show 
direction of each DNA strand; 
10 Figure 9 shows the DNA sequence around site A showing 
translations in single letter amino acid code of genes 
SalF 17R and SalF 19R; 

Figure 10 shows the DNA sequence around site B showing 
translations in single letter amino acid code of genes 
15 SalF 20R and SalF20,5R; 

Figure 11 shows a comparison of the SalG2R open 
reading f rcune to the yeast guanylate kinase gene 
sequence ; 

Figure 12 shows the DNA sequence around site D showing 
20 translations in single letter amino acid code of genes 
HindB3R and Hind B4R; 

Figure 13 shows the cloning of vaccinia virus (Wyeth 
strain) neutral sites; 

Figure 14 shows the cloning of vaccinia virus promoter 
25 sequences ; 

Figure 15 shows the construction of vaccinia 
promoter- driven E6-7 cassette; 

Figure 16 shows the cloning of the E6-7 cassette into 
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vaccinia virus (Wyeth strain) neutral sites; 
Figure 17 is a diagram showing the recombination 
required to generate the final therapeutic vaccinia 
virus - HPV recombinant virus; 
5 Figure 18 shows the synthetic oligonucleotides used in 
the construction of the therapeutic vaccinia virus HPV 
recombinant ; 

Figure 19 shows the nucleotide sequence of vaccinia 
virus (WR strain) from positions 17201-18450 of the 
10 region covered by the four fragments SalF,G,H and I; 
Figure 20 shows the nucleotide sequence of vaccinia 
virus (WR strain) from positions 21001-22000 of the 
region covered by the four fragments SalF,G,H and I; 
and 

15 Figure 21 shows the nucleotide sequence of vaccinia 
virus (WR strain) from positions 23501-25000 of the 
region covered by the four fragments SalF,G,H and I. 
Figure 26 shows a variety of options for arrangment of 
HPV16E6 and E7 and HPV18E6 and E7 coding sequences in 

20 a recombinant virus vector. 

All cloning procedures are carried out according 
to the protocols described in "Molecular Cloning", A 
Laboratory Manual, eds. Sambrook, Fritsch and 
Maniatis, Cold Spring Harbor Laboratory Press, 1989. 

25 All plasmids on which site directed mutagenesis is 
performed are of the "phagemid" type, which may be 
converted to single-stranded DNA by superinfection 
with the bacteriophage fl. Preparation and site 
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directed mutagenesis of single-stranded DNA, is 
carried out as described by Brierley et al.. Cell, 57, 



oligonucleotides used are provided in figure 18. 

5 Preparation of the E6 and E7 genes from HPV16 and 
HPV18 for insertion into vaccinia virus 
Cloning of the HPV16 and HPV18 E6 and E7 
A fragment of DNA containing the HPV16 E6/7 coding 
region is prepared by polymerase chain reaction (PCR) 
10 amplification from the plasmid pBR322/HPV15 (Durst et 
alo, PNAS, 80, 3812, 1983) using the oligonucleotides 
SOS and SOS . A fragment containing the same region 
from HPV18 is prepared by the same procedure from 
plasmid pBR322/HPV18 (Boshart et al., EMBO J. 3,1151) 
1 5 using the oligonucleotides SOI and S02 • Plasmids 
PBR322/HPV16 and pBR322/HPV18 are both available from 
Behringwerke AG, P.O. Box 1140 D-3550 , Marburg, 
Germany (alternatively the necessary sequences can be 
created synthetically from the sequence information 
20 provided by the present application). 

In each case, this produces a DNA fragment of 
about 800 base pairs (bp) with a site for the 
restriction enzyme Nco 1 (CCATGG) located exactly at 
the beginning of the E6 gene, and a Smal site 
25 immediately downstream of the termination codon for 
the E7 gene (figure 1(a) and (b)). The products are 
then digested with Ncol and Smal, and cloned into 



537, 1989. 



The sequence of all the synthetic 
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Ncol-Smal digested plasmid pUCllSNS (a modified 
version of the "phagemld" pUCllS (Vlera & Messing, 
Methods Enzymol., 153,3, 1987) In which Ncol and Smal 
sites have been created by site-directed mutagenesis 
5 within the poly-llnker region) to generate the plasmid 
plMS7, containing the HPV16 sequences, and pIHSS 
containing the HPV18 sequences ( figure 2 ) . The use of 
pUCllB Is not crucial to the present strategy as any 
plasmid which can be manipulated by site directed 
10 mutagenesis can be successfully used. 

Fusion of the E6 and E7 ORFs 

For Insertion Into vaccinia virus, the E6 and E7 genes 
from each HPV type, are first fused together to form 
a single continuous ORF* This Is achieved by 
15 site-directed mutagenesis as follows: 

(I) The termination codon TAA of HPV16 E6 In pIMS7 Is 
altered using the oligonucleotide S20 to the sequence 
GGAA. This Is In order to convert the normally 
separate ORFs for HPV16 E6 and E7 Into a single ORF 

20 (PIMS7.1 - figure 2). 

(II) The termination codon TAA of HPV18 E6 In pIMSS is 
altered using the oligonucleotide S21 to the sequence 
GGAA. This Is in order to convert the normally 
separate ORFs for HPVIB E6 and E7 into a single ORF 

25 (pIMSS.l - figure 2). 

Abolition of the immortalising potential of E7 
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In order -to destroy the inunortalxsing properties of 
each of the E7 proteins, two key codons within the 
HPV16 E7 coding sequence, (cys24 and glu26 - figure 
1(a)) and the equivalent codons from HPV18 E7 (cys27 
5 and glu29 - figure 1(b)), are altered to glycine 
residues by site directed mutagenesis as follows • 

(i) The sequence of the E7 gene is altered in pIMS7 
to encode glycine at codons 24 and 26 (normally 
encoding cysteine and glutamate respectively, using 

10 oligonucleotide S22 (pIMS7«2 - figure 2). 

(ii) The sequence of the E7 gene is altered in pIMSS 
to encode glycine at codons 27 and 29 (normally 
encoding cysteine and glutamate respectively, using 
oligonucleotide S23 (pIMSSolB - figure 2). 

15 Reduction in intertypic recombination potential of 
HPV16 and HPV18 E6 and E7 sequence and elimination of 
potential vaccinia virus transcription termination 
signal 



A potential difficulty with the presence of both HPV16 
20 and HPV18 E6 and E7 specific DNA within the genome of 
a single virus, is that recombination between the two 
sets of related sequences could lead to loss or 
rearrangement of information such that expression of 
the required proteins is disrupted. The invention 
25 provides ways of minimising this risk. Firstly, by 
inserting the two sets of genetic information in the 
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vaccinia genome in opposite orientation to each other 
( so that recombination will result not in the loss of 
sequence information, but rather in its inversion). 
Secondly, by creating specific changes in the E6/7 
5 sequence of one of the HPV virus strains at sites 
where the homology is greatest. These changes however 
are made in such a way that the amino acid coding 
potential of the genes remains unaltered. 

The HPV18 E6 sequences is therefore altered by 
10 site*directed mutagenesis as follows: 

The sequence TTTTTATTCTAGAATTAGAG (which begins 
210 nucleotides from the start of E6 - underlined 
in figure 1(b)) is mutated, using oligonucleotide 
S24 to the sequence TTTCTACAGTAGAATCAGAG (pIMS8.2 
15 - figure 2) (changed nucleotides are in bold 

type) . 

A second aim of this change is to eliminate from 
the HPV18 E6 sequence, the sequence TTTTTAT, which is 
a potential termination signal for the early vaccinia 
20 virus transcription enzyme (Rohrmann et al. , Cell. , , 46, 
1029, 1986). 



Source and propagation of vaccinia virus 
The Wyeth strain of vaccinia virus is used for 
construction of the therapeutic virus. It is propagated 
25 in Vero cells for the purposes of genetic manipulation, 
and in the human diploid fibroblast cell line MRC5 for 
the production of the final therapeutic virus stock. 
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Both cell lines are obtained from the National Institute 
of Biological Standards and Control, South Mims, U.K.. 
The Wyeth strain of vaccinia virus, vero cells and the 
cell line MRC5 are also available from the American Type 
5 Culture Collection, 12301 Parklawn Drive, Rockville, 
Maryland 20852, USA. 



Identification and cloning of neutral sites from the 
vaccinia virus 
Description of neutral sites 
10 For the purpose of insertion of papillomavirus genes 
within the vaccinia virus genome, sites have been chosen 
to have two characteristics- 

Firstly, they should be non-essential regions, i.e. 
insertion of foreign genes at these points, will not 
15 disrupt any functions of the vaccinia virus to the 
extent that the virus can no longer grow in tissue 
culture . 

Secondly, they should be neutral sites, i.e. 
insertion of foreign genes all these points, will not 
20 increase or decrease the level of attenuation of the 
vaccinia virus. 

The difference between these two factors can be 
seen by looking at the thymidine kinase (TK) gene of 
vaccinia virus. It is a non-essential region, and hence 
25 viruses with genes inserted in the TK gene can grow well 
in tissue culture (Mackett at al., J. Virol., 49, 857, 
1984). However, such viruses have been found to be 
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greatly attenuated in vivo (Buller et al*. Nature 317, 



vaccination, such increased attenuation might be 
desirable* However, for an inununotherapeutic strategy 
5 where the danger from the disease to be treated clearly 
outweighs the risk of vaccine associated complications, 
use of an attenuated virus is considered undesirable 
since it could compromise the immunological response to 
the papillomavirus antigens. Hence, the applicants have 
10 identified sites which they judge will not attenuate the 
virus any further, and have termed them 'neutral sites' . 
Such sites have been identified within the virus genome 
by careful analysis of the DNA sequence of the mi 
strain. The mi strain was originally derived from the 
15 Wyeth strain by passage in mouse brain. Therefore the 
two strains are closely related. The nucleotide 
sequence of three regions of the WR genome which contain 
the selected neutral sites are shown in figures 19,20 
and 21. Four neutral sites (A-D) have been chosen on 
20 the above discussed criteria as follows: 
Site A: gap between SalF17R and SalF19R 
Site B: gap between SalFlQR and SalF20.5R 
Site C: in SalG2R, a potential non- functional gene 
Site D: in HindB3.5R, a potential non- functional gene 
25 These sites (A-D) can be identified by the following 
stretches of DNA sequence, each of which is 40 
nucleotides in length. 

A CTATCTACCAGATTATTATGTGTTATAAGGTACTTTTTCT 



813, 



1985) . 



For the purpose of prophylactic 
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B 



TATTGTGCTACTGATTCTTCACAGACTGAAGATTGTTGAA 



C 



TCTCTTAAAATGGTTGAGACCAAGCTTCGTTGTAGAAACA 



D 



TGAGGCTACCTCGACATACGTGTGCGCTATCAAAGTGGAA 



In other strains these sequences may vary, while still 
5 having substantial homology with those given above. In 
particular a site may have at least 90%, more preferably 
95%, homology with the sequence given above. 
Figures 3-5 show the distribution of initiation codons 
and open reading frames (ORFs) in the regions of the 
10 vaccinia virus genome shown in figures 19,20 and 21. 
Figures 6-8 show the same regions with a plot showing 
to what extent each reading frame conforms to the 
pattern of codon usage expected for vaccinia genes. A 
graph of codon usage is plotted for each of the three 
15 possible reading frames in each direction (Staden, R. , 
Nucl- Acids Res., 12, 521, 1984; Staden, R. , Nucl. Acids 
Res., 12, 551, 1984). In these codon usuage plots, the 
short vertical bar lines extending from the horizontal 
axes represent start codons. The longer vertical bar 
20 lines placed above the horizontal axes represent stop 
codons. This sort of plot is a useful way of helping to 
determine whether a particular ORF is a genuine vaccinia 
gene. Where there is a likely genuine gene, the graph 
of codon usuage will rise between a start codon and a 
25 stop codon. For example, in figure 7, it can be seen 
that the graph of codon usage rises over the region of 
the SalG2R ORF (the dotted line shows that this frame 
conforms most of the expected codon usage). For the 
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other two frames the graphs show that they do not 
conform to vaccinia codon usage. The peak labelled 
'part of gk\ and marked with a dashed line, also 
conforms well to vaccinia codon usage. In stimmary, a 
5 genuine gene must start with an Initiation (start) 
codon, end with a termination (stop) codon, and should 
conform well to vaccinia codon usage along its length. 
In most cases the conformation to the vaccinia codon 
usage drops off sharply outside the gene. 
10 The neutral sites are further described as follows 
Site A. Gap between SalF17R and SalF19R 
Site A is marked on figures 3 and 6. Figure 9 shows the 
actual DNA sequence with a translation of the ORFs on 
either side of the site. It can be seen that Site A is 
15 placed in an Intergenlc region between SalF17R and 
SalF19R. It is placed some 195 bases upstream of 
SalF19R to avoid any promoter elements associated with 
that gene. The sequence TTTTTCT (shown in italics) will 
act as a terminator of early RNA transcription for the 
20 SalF17R gene if it is an early gene. However, the site 
is placed downstream of the first of these, so it will 
not affect early termination of transcription if it 
occurs. Examination of figure 6 shows that there is no 
recognisable gene on the opposite strand at this point, 
25 and hence this sequence location is suitable as a 
neutral insertion site. 

Site B. Gap between SalF19R and SalF20.5R 

Site B is marked on figures 3 and 6. Figure 10 shows 
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the actual DNA sequence with a translation of the ORFs 
on either side of the site. It can be seen that Site 
B is placed in an intergenic region between SalF19R and 
SalF20o5R- Figure 6 shows that it is within a region 
5 of high vaccinia codon usage, but that this region does 
not form a genuine gene, having no initiation codon* 
In addition, figure 6 suggests that Sal20-5R is not a 
complete gene, as the conformation to vaccinia codon 
usage drops off dramatically at the start of the gene* 
10 In the event that SalF20-5R is a genuine gene. Site B 
is placed some 70 bases upstream of SalF20o5R which may 
well avoid any promoter elements associated with that 
gene. (Note: many vaccinia promoter elements are 
located in approximately 35 bases upstream of the start 
15 of the gene. ) In addition SalF20R has no TTTTTNT (N=any 
nucleotide) transcription termination signal with which 
Site B could interfere. Hence this sequence location 
is suitable as a neutral insertion site. 



Site C. Within SalG2R, a potential non- functional gene 
20 Site C is marked on figure 4. The ORF SalG2R has 
considerable similarity to the guanylate kinase (GK) 
gene of yeast. This similarity is shown in figure 11. 
Sequence upstream of the SalG2R ORF (but in a different 
frame) has been added on to SalG2R, to see if the match 
25 to GK extends beyond the boundaries of the original open 
reading frame. The match appears to extend beyond the 
5^ end of the SalG2R ORF. In particular, an important 
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site in the yeast GK gene, the ATP/GTP binding site 
(shown underlined) only matches in the out of frame 
sequence upstream of the SalG2R ORF* Hence, it is very 
likely that the SalG2R gene is not active as a guanylate 
5 kinase and can be referred to as a 'pseudogene' • If the 
gene is inactive as the applicants deduce, then it will 
serve as a neutral insertion site. 

Site Do Within HindB3,5R, a potential non- functional 
gene 

10 Figure 5 shows that site D lies within the region 
designated HindB3,5R. This region, although conforming 
to vaccinia codon usage, has no start codon and is 
therefore not a genuine gene. The codon usage plot 
shown in figure 8 indicates that it probably was once 

15 a functional gene, and may well have been attached to 
HindBSR (a shift in the codon usage preference occurs 
here well away from the termination codon of the HindB3R 
ORF which suggests that the last section of HindBSR is 
not properly part of this gene. ) Hence it is likely 

20 that HindBS . 5R is not active as a gene and can be used 
as a neutral insertion site. Figure 12 shows the actual 
DNA sequence with a translation of the ORFs on either 
side of the site. It can be seen that site D is placed 
in an intergenic region between HindBSR and HindB4R as 

25 well as being within the non- functional HindBS. 5R. 

Preparation of vector for cloning of neutral sites 

In order to insert foreign genetic information into the 
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neutral sites described above, DNA copies of the neutral 
sites, together with an appropriate amount of flanking 
DNA from the vaccinia genome (approximately 500 bases 
on either side) must first be cloned into a plasmid 
5 vector o These plasmids may then be used to introduce the 
foreign DNA into the vaccinia virus genome; the vaccinia 
virus ' flanking sequences ' around the inserted gene 
seirve to allow homologous recombination between the 
plasmid DNA and the viral DNA, with the consequent 
10 insertion of the foreign gene at the desired location. 



Cloning of neutral site sequences 

Plasmids containing flanking regions from the neutral 
sites are constructed as follows • DNA is prepared from 
the Wyeth strain of vaccinia virus by the method of 
15 Esposito et al., (J. Virol, Meth. 2: 175, 1981 )• The 
polymerase chain reaction ( PGR ) is used to remove an 
approximately 1000 base pair (bp) fragment from DNA of 
the Wyeth strain of vaccinia virus. Pairs of 
oligonucleotides are chosen approximately 500 bp either 
20 side of the chosen neutral site. These oligonucleotides 
are based on the sequence of the WR strain, but are 
chosen in regions where the sequence of the WR strain 
is identical to that of the Copenhagen strain (Goebel 
et alo. Virology 179:247, 1990). The oligonucleotides 
25 incorporate restriction enzyme recognition sequences so 
that they can be cloned easily into a plasmid. For 
neutral sites A,B and D the restriction sites are EcoRl 
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10 



15 



20 



and Hindlll. For neutral site C the Hindlll site is 
replaced by an Sphl site, since there is an internal 
Hindlll site in the chosen flanking sequences. 
The oligonucleotides used for PCR are listed below: 

Site A leftMB 16 

Site A rightMB 17 

Site B leftMB 24 

Site B rightMB 25 

Site C leftMB 18 

Site C rightMB 19 

Site D leftMB 22 

Site D rightMB 23 

DNA fragments of approximately Ikb are then prepared 
using these pairs of oligonucleotides by PCR 
amplification, digested with EcoRI and Hindlll (for site 
A,B and D) or with EcoRI and Sphl (for site C) and 
cloned into Hindlll and EcoRI -digested pUCllS (figure 
13) to generate the plasmids pIMMC7a, pIMMC7b, pIMMC7c 
and pIMMC7d. 

Creation of unique restriction sites for insertion at 
the neutral sites 

A suitable restriction enzyme site is then introduced 
at the selected location within each of the plasmids. 
This is achieved using site directed mutagenesis using 
an oligonucleotide containing the desired new unique 
site and flanked by 15 bases of sequence to either side 
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( see below ) . The plasmids modified in this fashion are 
deslgrnatred pIMMC8a-d ( figure 13 ) . 

original ollgonucleotrldesite new plasmid 
plasmid intiroduced 



pIMMC7a MB35SnaBl pIMMCSa 

pIMMC7b MB36Hpal pIMMCb 

pIMMC7c MB37Stul pIMMCSc 

pIMMC7d MB38SnaBl pIMMCSd 



10 Cloning of the vaccinia virus early/ lat:e promoter 
sequences 

The p7.5 and H6 promoters from vaccinia virus genomic 
DNA are prepared by PCR amplification as described 
below. 

15 A pair of complementary oligonucleotides (S7 and S8 ) is 
synthesised to include the following restriction enzyme 
sites, Hindlll, SnaBl, Hpal, Hindlll, Sail, Ncol, Smal, 
SnaBl and EcoRl, such that the pair, after annealing, 
present at one end Hindi II compatible overhanging ends, 

20 and at the other, EcoRl compatible overhanging ends. 
The two oligonucleotides are allowed to anneal and are 
inserted into pUCllB cut with EcoRl and Hindlll (figure 
14). The resulting vector is called pIMMC3. 

A DNA molecule of approximately 180 bp containing 
25 the H6 promoter is removed from the WR strain of 
vaccinia virus by PCR amplification using the 
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oligonucleotides MB15 (anneals upstrream and includes a 
5 '-Sail site) and MB7 (anneals downstream and includes 
a 5'-HindIII site). This is cloned into pIMMC3 cleaved 
with Hindlll and Sail to create pIMMC4a (figure 14). 
5 A DNA molecule of approximately 200bp containing the 
p7.5 promoter is then removed from the WR strain of 
vaccinia virus by PGR eunplif ication using the 
oligonucleotides MB32 (anneals upstream and includes a 
5 ' -Sail site) and MB33 (anneals downstream and includes 
10 a 5'Ncol site). This is cloned into pIMMC3 cleaved with 
Ncol and Sail to create pIMMC14b. 
Construction of the therapeutic virus 

The strategy required to generate a recombinant vaccinia 
virus containing and expressing the E6-E7 proteins from 
15 HPV16 and HPV18, based on the elements described above 
involves five main stages as outlined below, 
i ) Cloning of the modified E6-7 genes downstream of 
vaccinia early promoter sequences 

A DNA fragment containing the modified HPV16 E6-7 
20 sequence is excised from pIMS7.2 by digestion with 

Hindi II and Smal, and cloned into Hindi I I and 
Hpal-digested pIMMC4a to generate pIMS12 (figure 
15) . 

A DNA fragment containing the modified HPV18 E6-7 
25 sequence is excised from pIMS8.2 by digestion with 

Ncol and Smal, and cloned into Ncol and 
Smal-digested pIMMC14b to generate pIMS14 (figure 
15). 
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) Preparatrion of a plasmid vector con-tainlng both 
HPV16 and HPV18 E6-7 sequences toge-bher with their 
upstream vaccinia promoters . 

A DNA fragment containing the HPVIB E6-7 region 
together with the upstream p7 . 5 promoter is 
excised from pIMS14 with Sail and Smal and 
inserted into Sail and Smal-digested pIMS12 to 
generate pIMSlS (figure 15) 
iii) Insertion of the HPV E6- 7 /promoter "double" 
cartridge into the neutral site containing 
plasmids . 

A DNA fragment containing both the HPV16 and HPV18 
E6-7 coding regions together with their upstream 
promoter elements is excised from plMSlS with 
15 SnaBl and inserted into the appropriately-digested 

neutral site-containing plasmids pIMMC7a-d. This 
step is shown in figure 16, and the resulting 
plasmids are designated pIMMC9a-d. 
iv) Introduction via homologous recombination of the 
20 neutral site DNA, together with the intervening HPV 

sequences, into the vaccinia virus genome to create 
a recombinant virus expressing the two modified HPV 
E6-7 seguences . 

The recombinant plasmids pIMMC9a-d are purified 
25 and allowed to recombine into vaccinia (figure 17) 

using standard protocols (Mackett et al., in D.M. 
Glover ( ed ) DNA Cloning : a Practical Approach, 
Oxford and Washington DC, IRL Press, 1985), 
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Viruses which have acquired the HPV sequences are 
identified hy probing with radiolabelled HPV 
specific sequences. Viral plaques are lifted onto 
nitrocellulose (Villareal and Berg, Science 196, 
183, 1977) and probed with radiolabelled Nco I- Smal 
fragment from pIMS14 containing the HPV18 E67 
gene. Recombinant viruses are then isolated from 
the agarose overlay and plaque purified three 
times. They are checked for the presence of the 
appropriate DNA sequences by Southern blotting of 
purified virus DNA using DNA probes derived from 
the HPV E6 and E7 genes, and for expression of the 
appropriate sequences by western blotting using 
antisera specific for the HPV E5 and E7 proteins. 

Cloning of the therapeutic virus in MRC5 cells 
Stocks of the final recombinant virus are prepared 
by growth in Vero cells, and are used to infect 
MRC5 cells deemed suitable for the preparation of 
material suitable for use as human vaccines. The 
virus is plaque-purified three times by standard 
methods, and finally a stock prepared for clinical 
use. 

Confirmation of presence of the correct HPV DNA 
insert 

A sample of this stock virus is checked once again 
for the presence of correctly configured virus 
DNA, and for expression of the correct virus 
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proteins. Figure 22 shows the analysis by PGR, of 
a recombinani: vaccinia virus (v9a.l) in which the 
HPV DNA cassette is inserted at Site A . The 
diagram shown in panel (a) indicates the DNA 
5 fragments expected if insertion of the correct DNA 

has occurred. It can be seen in panel (b) that 
the actual pattern of PGR products generated is 
consistent with that expected. 



Confirmation of expression of the HPV DNA insert 
10 The recombinant viruses are then checked for 

expression of the expected HPV proteins. An 
example of this analysis is shown in Figure 23. 
Vero cells are infected with with recombinant 
virus v9a.l (HPV DNA inserted at Site A), and the 
1 5 cells examined by western blotting for the 

presence of the HPV E67 fusions proteins using 
monoclonal antibodies specific for the HP VI 6 E7 
protein (camvirS ) and for the HPV18 E7 protein 
( 7E10 ) • It can be seen that both monoclonal 
20 antibodies recognise specifically proteins of the 

expected size in cells infected with the 
recombinant virus vQa.l, but not in cells infected 
with the control parent virus Wyeth strain. These 
recognised proteins co-migrate with proteins 
25 synthesised by in vitro translation of mRNA 

encoding the expected HPV fusion proteins (HPV 16 
E67 and HPV18 E67). This experiment indicates 
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successful expression of the heterologous gene 
sequences from the recombinant virus • 
Stability of the HPV DNA insert 

For the recombinant virus to be of use clinically^ 
it is important that the inserted sequences remain 
genetically stable over multiple virus passage, 
and the DNA insert was carefully designed to 
promote this genetic instability. To confirm the 
stability of the HPV information within the 
recombinant virus genome, the virus is subjected 
to 9 serial passages, (multiplicity of infection 
=10pfu/cell) in Vero cells. Subsequently 20 
plague isolates are picked, and analysed for the 
presence of the correct HPV DNA insert by PCR 
analysis as described in Figure 22. The data 
obtained for recombinant virus A are shown in. 
Figure 24. All 20 virus isolates retained the HPV 
information in the expected genetic arrangement 
indicating a considerable degree of genetic 
stability. 

Animal experiments 

The virulence of the recombinant virus is compared 
in animal experiments with that of the parental 
Wyeth strain. Groups of 20 mice are inoculated 
intranasally each with 10^ pfu of Wyeth strain or 
recombinant virus in a total volume of 20\il. Two 
mice are sacrificed at 1 day, 3 days and 5days 
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following inoculation, and the lungs dissected 
out* Tile amount of virus present in the lungs is 
then measured by grinding the tissue, and assay of 
the homogenate by standard vaccinia virus plaque 
5 assay • The results of such an experiment for the 

recombinant virus v9a.l (HPV infromation inserted 
at site A) are shown in Figure 25. It can be seen 
that the recombinant virus retains the ability to 
replicate in mice , and that the level of virus 
10 produced in the lungs of the infected animals is 

similar to that seen with the parental Wyeth 
strain. 



Therapeutic Use 

A stock of the recombinant virus is prepared by 
15 infection of MRC5 cells, and adjusted to a 

concentration of not less than 10® pfu/ml. 20 ^l 
of this material is applied to the arm of the 
patient, which is then scarified through the virus 
droplet with a bifurcated needle, according to the 
20 standard procedure used for vaccination against 

smallpox . 
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CLAIMS 



1. A recombinant virus vector for use as an 
immunotherapeutic or vaccine which comprises at least 
one pair of nucleotide sequences heterologous to said 
5 virus which have sufficient sequence homology that 
recombination between them might be expected; 

wherein said pair of nucleotide sequences are 
arranged in said virus vector such that they are 
inverted with respect to each other to reduce the 
10 likelihood of recombination events leading to loss of 
part or all of said sequences and said virus vector is 
able to infect a mammalian host cell and express as 
polypeptide the heterologous nucleotide sequences in 
said host cell* 

15 2. A recombinant virus vector according to claim 1 
wherein at least one pair of nucleotide sequences 
encode part or all of human papillomavirus (HPV) wild- 
type proteins or mutant proteins immunologically 
cross -reactive with said wild-type proteins. 

20 3* A recombinant virus vector according to claim 1 
or claim 2 wherein the pair of nucleotide sequences 
encode part or all of the HPV wild-type proteins 
HPV16E7 and HPV18E7 or mutant proteins immunologically 
cross-reactive therewith. 
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4. A recombinant virus vector according to claim 1 
or claim 2 wherein the pair of nucleotide sequences 
encode part or all of the HPV wild-type proteins 
HPV16E6 and HPV18E6 or mutant proteins immunologically 
cross -reactive therewith. 



5* A recombinant virus vector according to claim 3 
which comprises a further pair of nucleotide sequences 
which encode part or all of the HPV wild-type proteins 
HPV16E6 and HPV18E6 or mutant proteins immunologically 
10 cross-reactive therewith. 



6. A recombinant virus vector according to any one 
of the preceding claims wherein two or more nucleotide 
sequences of different said pairs may be fused 
together to form a single open reading frame. 

7. A recombinant virus vector according to claim 6 
wherein the fusions are via a single codon encoding a 
small neutral amino acid. 



8. A recombinant virus vector according to claim 7 
wherein the amino acid is glycine. 

20 9. A recombinant virus vector according to claim 5 
in which the pairs of nucleotide sequences are 
arranged in the virus vector according to any one of 
the options as shown in Figure 26. 
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10. A recombinant: virus vector according to claim 9 
which comprises 

a first open reading frame having a fused genetic 
sequence encoding part or all of the wild-type 
proteins E6 and E7 from HPV16; and 

a separate second open reading frame having a 
fused genetic sequence encoding part or all of the 
wild- type proteins E6 and E7 from HPV18; 

wherein the first and second open reading frames 
may be inverted with respect to one another whereby 
either: 

i) the E6 coding sequences of HPV16 and HPV18 
are both located between the E7 coding sequences of 
HPV16 and HPV18; or 

ii) the E7 coding sequences of HPV16 and HPV18 
are both located between the E6 coding sequences of 
HPV16 and HPV18; and 

wherein any of said wild-type proteins may be 
replaced by a mutant protein immunologically cross - 
reactive therewith. 

11. A recombinant virus vector according to claim 10 
wherein each of the first and second open reading 
frames has a corresponding promoter and the two open 
reading frames each with its promoter, are arranged 
next to each other in the virus. 

12. A recombinant virus vector according to claim 11 
wherein either: 
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i) the promoters are located between the first 
and second open reading frames whereby the open 
reading frames are transcribed outwardly; or 

ii) the promoters are located outside the first 
5 and second open reading frames whereby the open 

reading frames are transcribed inwardly. 

13 . A recombinant virus vector according to any one 
of claims 9 to 12 which comprises 

a first open reading frame having a fused genetic 
10 sequence encoding part or all of the wild-type 
proteins E6 and E7 from HPV16; and 

a separate second open reading frame having a 
fused genetic sequence encoding part or all of the 
wild-type proteins E6 and E7 from HPV18; 
15 wherein the E6 coding sequences of HPV16 and 

HPV18 are both located between the E7 coding sequences 
of HPV16 and HPV18; and 

each open reading frame has a corresponding 
promoter, the promoters being located between the 
20 first and second open reading frames whereby the open 
reading frames are transcribed outwardly; and 

wherein any of said wild-type proteins may be 
replaced by a mutant protein immunologically cross- 
reactive therewith. 

25 14 . A recombinant virus vector according to any one 
of the preceding claims wherein either or both of the 
nucleotide sequences in a said pair of nucleotide 
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sequences are alt:ered to make them less homologous 
than an equivalent pair of nucleotide sequences 
encoding wild-type HPV proteins. 

15 • A recombinant virus vector according to claim 14 
5 wherein the alteration in nucleotide sequence does not 
result in an alteration of the encoded amino acid 
sequence • 

16. A recombinant virus vector according to claim 3 
wherein the wild-type proteins HPV16E7 and HPV18E7 are 

10 replaced with mutant proteins which are substantially 
homologous to said wild-type proteins and in which the , 
residues cys 24 and glu 26 of wild-type protein 
HPV16E7 and the residues cys 27 and glu 29 of wild- 
type protein HPV18E7 are replaced with glycine ^ 

15 residues. 

17. A recombinant virus vector according to any one 
of the preceding claims wherein said heterologous 
nucleotide sequences may comprise part or all of the 
sequences shown in Figures 1(a) and 1(b). 

20 18. A recombinant virus vector according to any one 
of the preceding claims which is derivable from 
vaccinia virus. 

19. A recombinant virus vector according to any one 
of the preceding claims wherein the nucleotide 
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sequences are inser-ted into the virus vector at; one or 
more neutral sites, the disruption of which by the 
insertion of the nucleotide sequences does not 
substantially adversely affect viral functions 
5 relating to the replicative ability of the virus in 
the mammalian cell. 

20. A recombinant virus vector according to claim 19 
which is derivable from vaccinia virus and wherein the 
neutral sites may be one or more of: 
10 A) the gap between SalIF17R and SallFlQR of 

strain WR comprising at least part of the sequence 
CTATCTACCAGATTATTATGTGTTATAAGGTACTTTTTCT; 

B) the gap between SalIF19R and SalIF20,5R of 
strain WR comprising at least part of the sequence 

1 5 TATTGTGCTACTGATTCTTCACAGACTGAAGATTGTTGAA ; 

C) a region in SalIG2R of strain WR comprising 
at least part of the sequence 

TCTCTTAAAATGGTTGAGACCAAGCTTCGTTGTAGAAACA; 



20 at least part of the sequence 

TGAGGCTACCTCGACATACGTGTGCGCTATCAAAGTGGAA; 

E) a sequence having at least 90% sequence 
homology to those sequences A) to D) identified above. 

21. A method for making a recombinant virus vector 
25 according to claim 19 or claim 20 which comprises 
inserting a said heterologous nucleotide sequences into 
one or more neutral sites in a virus vector, the 



D) 



a region in HindB3.5R of strain WR comprising 
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disruption of such a site by said insertion will not 
substantially adversely affect the replicative ability 
of the virus and wherein the neutral site has been 
previously identified by: (a) analysing a viral genome 
5 to identify open reading frames which are likely to 
encode functional genes; and (b) selecting sites between 
open reading frames for functional genes or sites within 
sequences for non- functional genes. 

22, A recombinant virus vector obtainable by the method 
10 of claim 21. 

23. A method which comprises using a recombinant virus 
vector according to any one of claims 1 to 20 or to 
claim 22 to manufacture a medicament for use as an 
immunotherapeutic or vaccine against a condition thought 

15 to be caused by HPV infection, for example cervical 
cancer , 



24. A method which comprises using a recombinant virus 
vector according to any one of claims 1 to 20 or to 
claim 22 to specifically activate cells of the immune 
20 system to HPV proteins. 
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Sa I F I 7R 

GflTTGTGTGCfiTGCGTGGTflGflTGTTTGGflGflflflTGflGflflflCTGTTTTCTftCflTGGflHflT 

LC flCUUDUU RNEKLFS fiUtCV 

flTTGTTTflCGflGCTflTTflflflCTGTTTflTTflflTGflTCflCflTGCTTGflTflflGflTflflflflTCTfl 

CLRfllKLFIHOHniDKl KSI 

TflCTGCflGflflTflGflCTflGTflTflTGTGGflflflTGTCflTflGflflflGTTflflflflGTTflflTGflGHGC 
LQHRLUVUEnS" 

flflflflflTflTflTflflGGTTGTflTTCCflTflTTTGTTflT TTrrrC rGTflflTflGTTflGfiflflflflTflC 



flTTCGflTGGTCTflTCTflCCnCRTTfiTIflllLIiTTflTflflGGTflC rrrrrcrCflTflflTflflflC 
TflGflGTflTGfiGTfiflGflTRGTGTTTTTCflflflflCflTflTflflflTCTflflflflTTGflTGGflTGflGflT 

flTflCflGCTflTTflRTTTCGflflRRTflTflTTTTflflTCTGRTflflCTTTflRflCflTGGflTTTTTGR 

SalF]9R 

TGGTGGTTTflflCGTTTTflflflflflflflGRTTTTGTTflTTGTflGTnTflTGflTflfiTflTTflflflflGfl 

n I I L K D 

TGGflTRTflflflGflflTTTGCTGflCTCCflTGTflCTRTTTTTTRCflTTflCTflCfiTTGGCTflCGG 

GVKEFflOCnVVFLHVVlGVG 
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Sa I F20R 

TGCCflflGGTTflGflTGTflflTGGTflflCGflTflflCflCflflflflTGTGflflCGCTGCCCncCTCflTflC 
^*^^fCNGNDNTKCERCPPHT 

flTflTflCCflCfiflTCCCflflTTflTTCTflflTGGflTGTCflTCflflTGTflGflflflflTGCCCflfiCCGGfl ,200 
^ T T I P I I L n D U t H U £ M fl Q P 0 

TCflTTTGflTflflGGTflflflGTGTflCCGGflflCflCflGflflCflGTfiflflTGTTCGTGTCTTCCTGGT 1 7fin 

H L I R " I ^ ou 



Site B 

TGGTflTTGTGCTflCTGflTT£Il£l£.flGflCTGflflGflTTGTTGflflflTTGTGTflCCflflflflflGG 



CGflTCTTGTTRTflflTTflTflflTflTflRflflflTflCTTfiTGGCflTflGTflflCTCflTflflTTGCTGflC 



sonsTmruiTis sheet 
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1320 



RGflTGTCCflTGCGGflTflCTTTGGTGGflflTflGflTGflflCflflG'GflflflTCCTflTTTGTflfiflTCG 1330 

"NKEILFUHR 

TGTTGTGTTGGTGflflTflTTGCGflCTflCCTflCGTflflTTRTflGflCTTGflTCCflTTTCCTCCfl i-,-,o 
^ " <- U N I fl T T V U I I 0 L I H F L H 



1500 



TGCnflflCTflTCTflflflTGTflflTTflflTTRTGflTTTTGflTGflTflRTGTTflCCflTflCfiTTflTflT 
''"''1-NUINVOFOOHUTlHVl 

CGCTflCTTGGTTRGTGTflTTflTTCfl^GTflTGflflGflCCTflTTflflTflflTTflCTTflTCTTTTGfl ,560 
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Gap Ueight. 3.000 Ruerage Holch. O.S^O 

Length Ueight. 0.100 Rueroge nismotch. -0.396 

Quolity. 113.0 Length. 199 

Rotio. 0.608 Gaps. 4 

Percent Similarity. 56.354 Percent Identity-. 29.282 

The top sequence \z SalG2R and the bottos sequence is the yeost GtC 



Start of G2R 

u 

I nSGIUKSI lLS££££LJLnfllflk:RLnGIV. . .LDLUCPIPLDFLULHERE .4 7 



1 SRPIUiS GPSGTGKS TLLKgLFREVPDSFGFSUSSTTRTPflflGgUH 16 

48 GUDVHVUHREfllUKGIflflGHFLEHTEFLCHIVGTSKTflUHTflfllMHRICU 97 

4 7 GKDVHFUSUDEFtCSniKHHEF lEUROFSGMVVGSTUflSUKOUSKSGKTC I 96 

98 nDLHIDGURSLKH.TVLtlPVSUVIRPTSLKnUETKLRCflHTERHDEIHRR 1 46 

97 LDIOnQGUKSUJCfllPELHRRFLFIflPPSUEDLKKRLEGRGTETEESI HKR 1 46 

147 UlLRKTOnOERHEflGLFOTI I I EDOUMLflVSKL I 0 . I LQOR I RnVFNTN 194 

147 LSRROflELflVfl . E TGRHDKU I UHOOLDkCflVKELKDF I FflE>: 1 86 
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HindB3R 

CTAAGAACACGTATACGGCAGCAGCTTCCTTTATACTCTCATCTTTTACCAACACAAAGG ^ay 
LRTRIRQQLPLYSHLLPTQR 

GTGGATATTTGTTCATTGGAGTTGATAATAATACACACAAAGTAATTGGATTCACGGTGG 
VDICSLELIIIHTK* 

GTCATGACTACCTCAGACTGGTAGAGAATGATATAGAAAAGCATATCAAAAGACTTCGTG 



TTGTGCATTTCTGTGAGAAGAAAGA6GACATCAAGTACACGTGTCGATTCATCA 



567 
627 



AGGTAT 687 



Site D 

ATAAACCTGGGGATGAGGCTACCTCGACAIA£fiI£TGCGCTATCAAAGTGGAAAGATGCT 

GTTGTGCTGTGTTTGCAGATTGGCCAGAATCATGGTATATGGATACTAATGGTATCAAGA 

AGTATTCTCCAGATGAATGGGTGTCACATATAAAATTTTAATTAATGTAATAGAGAACAA 

ATAATAAGGTTGTAATATCATATAGACAATAACTAACAATTAATTAGTAACTGTTATCTC 

' I n I lAACTAACCAACTAACTATATACCTATTAATACATCGTAATTATACTTCTTAACA 

TCTATTAATCATTAATTCGCTTCTTTAATTTrrr^rAAACTAACATrGTTAATTGAAAAG 
rr.^. HindB4R 

GGATAACATGTTACAGAATATAAAmTATATGGATTTTTTTAAAAAGGAAATACTTGAC 

M D F F K K E I L D 

TGGAGTATATATTTATCTCTTCAmTATAGCACGCGTGTTTTCCAATTTTTCCACATCC 1167 
""SIYLSLHYIARVFSNFSTS 
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Synthetic oligo 
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Cut With Hindlll and EcoRI 



pH6 promoter 



PGR product digested 
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SnaBI Hpal Hindlll Sail Ncol Smal SnaBI 
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p7.5 promoter 



PGR product digested 
with Sail ana Ncol 



SnaBI^Hp^indiil Sail Ncol Smal SnaBI 
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Hindm^^^NcoI 



^["^^SnaBI 




HPV16 E67 



Hindm/Smal 
fragment from 

plMS7.2 



Digest with 
Hindlll/Hpal 



Digest with 
Ncol/Smal 



^^Ncol ^^^^ 
SnaBI HPV16E67'^|"/!23lSnaBI SnaBll^P^ 



HPV18 E67 
V ////////A 

Ncol /Smal 
fragment from 
PIMS8.2 



plMS12 
ampr 
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HPV18 E67 
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plMS13 
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Digest with 
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Digest with 
Sail and Smal 



Sail 



SnaBI HPV16 E^^PI^.'^'lsnaBI 



plMS12 
ampr 

_l«.:.:U:^!i!|tV^sW!-!-l 



jdmiSmal,- 



Sail 

Ncol Smal 
. IhPVIB E67 

wz 



///////A 



' Sail 

^ Hindin Ncol 

SnaBI HPV16 E67 



Smal 



Fig. 15. 



31 HPV16E67 I IHPV18E67I . 



,SnaBI 



plMS15 
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HPV16 E6-7 HPV 18 ES-T 



pIMSIS 
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Cut with SnaBI 
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plMMB9 a-d 
HPV16E67 HPV18E67 



Recombination 
plasmid 



L _HPV16E67 , HPV18E67 R' I 



homologous 
recombination 





Virus genome 

Si'Si%VAV%V»S*J— - — — - — — - — — - containing neutral 
L R site 



Recombination between 
plasmid and virus genome. 
Recombinant selected by probirig 
with HPV-spejfic sequences. 
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TCAGGATCCCACATGAGCGAAAAATACATCG 

TCAAAGCTTATTACGATACAAACTTAACGGA 

TCAGTTAACATAAAAAGAACAACGCCCGGCAG 

TCAAGGGCCTCTATATAGTAATACCAATACTC 

TCAGTCGACTTACAAACAACTAGGAAATTGG 

TCAGAATTCTATGTACAGAGGTCTATTAGGC 

TCAAAGCTTGTATGAGGTGGGCAGCGTTCAC 

TCAGAATTCTTAATTATATTGTCGGCCGTGG 
TCAGCATGCATGATCCGTTAGCTTTGGGCTC 
TCAGAATTCGAAGCTCTAGAGTATCTTAGCG 
TCAAAGCTTTCCTGTATTATATGGGATGTGG 
TCAGAATTCATTGATGGATGAGATATACAGC 
TCAAAGCTTTCACAAAATCG 

TCAGAATTCCACGTATACGGCAGCAGCTTCC 

TCAAAGCTTTGTTCTACGTCCATTTTCAAGC 

TCAGTCGACATACCAATACTCAAGACTACGA 

TCACCATGGATTGCTATTGATTGAGTACTGTTC 

AGTACCTTATAATACGTAATAATCTGGTAG 

AATCTTCAGTCTGTTAACAATCAGTAGCAC 

TACAACGAAGCTAGGCCTCAACCATTTTAA 

TTTGATAGCGCATACGTATGTCGAGGTAGC 

AATAGGTATATAGTTAACTGGTTAGTTAAA 

ATCCCATGGCGCGCTTTGAGGATCCAAC 

TCACCCGGGTTACTGCTGGGATGCACACCAC 

ATCCCATGGACCAAAAGAGAACTGCAATGTTTC 

TCACCCGGGTTATGGTTTCTGAGAACAGATG 

AGCTCTACGTAGTTAACAAGCTTGTCGACCCATGGCCCGGGTACGTA 

AATTTACGTACCCGGGCCATGGGTCGACAAGCTTGTTAACTACGTAG 
GAAACCCAGCTGGGAATCATGCATGG 

GAAACACAAGTAGGAATATTAAGTATG 

GATCTCTACGGTTATGGGCAATTAAATGAC 

GACCTTCTAGGTCACGGGCAATTAAGCGAC 

ATGTATAGATTTCTACAGTAGAATCAGAGAATTAAG 
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TATTGTTTACGAGCTATTAAACTGTTTArTAATGATCACATGCTTGATAAGATAAAATCT 6 0 

YCLRAIKLFINDHMLDKIKS 
IVY£LLNCLLMITCL IR*NL 
LFTS Y*TVY**SHA**DKIY 

ATACTGCAGAATAGACTAGTATATGTGGAAATGTCATAGAAAGTTAAAAGTTAATGAGAG ^ 20 

ILQNRLVyVEMS*KV KS**E 
Y C R I D * YMWKC HRKL KVN E S 
TA£*TSICGNVI£S*KLMRA 

CAAAAATATATAAGGTTGTATTCCATATrTGTTATTITTrCTGT^^ 180 
QKYIRLYSIFVIFSVIVRKI 
KNI*GCIPYLLFFL**LEKY 
KIYKVV'FHICYFFCNS »k'nT 

CATTCGATGGTCTATCTACCAGATTATTATGTGTTATAAGGTACTTTTT 24 0 

HSMVYLPDYYVL'GTFSHNK 
IRWSIYQI-IMCYKVL-FLIIN 
FDGLSTRLLCVIRYFFS**T 

CTAGAGTATGAGTAAGATAGTGTTTTTCAAAACATATAAATCTAAAATTGATGG^ 300 
I'EYE*DSVFQNI*I*N*WMR 
* SMSKIVFFKTYKSKIDG*D 
RV*VR*CFSKHINLKLMDE I 

TATACAGCTATTAATTTCGAAAATATATTTTAATCTGATAACTTTAAAC^^ 360 
YTAINFENIF*SDNFKHGF L 
IQLLISKIYFNLITLNMDF* 
YSY*FRKYILI **L*TWIFD 

ATGGTGGTTTAACGTTTTAAAAAAAGATTTTGTTATTGTAGTATATGATAATAT^ 4 2 0 

MVV»RFKKRFCYCSI* 
WW F NVLKKD FVI VVY D N*' KR 
GGLTF-KKILLL*YMI I^LKD 

ATGGATATAAAGAATTTGCTGACTGCATGTACTATTTTTTACA^^ 480 
MDIKNLLTACTIFYITTLAT 
^ * I C * L H V L F F T L L H W 1 R 

GYKEFADCM YYFLHYYIGYG 

GCAGATATACCTACTCCGCCACCAACGGGTCATGTGACAAGGGAGAATATCXTGATAAGA 540 
ADIPTP PPTGHVTRENILIR 

qiyllrhqrvm'qgris**" 
rytysatngscd kgeyldk^'r 

GGCATAATCAATGTTGTAATCGGTGTCCJ^CCTGGAGAATTTGCCAAGGTTAGATGTAATG 600 

giinvvigvhlenlprldvm 

A*SML*SVSTWRICQG' M*W 
HNQCCNRCPPGEFAKVRCNG 

GTAACGATAACACAAAATGTGAACGCTGCCCACCTCATACATATACCACAATCCCAATTA 660 
VTITQNVNAAHLIHIPQSQL 
• R* HKM»TLPTSYIYHNPNY 
NDNTKCERCPPHTYTTIPII 
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Fig. 19(Cont). 

TTCTAATGGATGTCATCAATGTAGAAAATGCCCAACCGGATCArTTGATAAGGTAAAGTG 
F*WMSSM*KMPNRII - "GKV 

SNGCHQCRKCPTGSFDKVKC 
LMDVINVENAQPDHLIR»SV 

TACCGGAACACAGAACAGTAAATGTTCGTGTCrrCCTGGTTGGTATTGTGCTACTGATTC 7 R n 

YRNTEQ*MFVSSWLVLCY-F 
^GTQNSKCSCLPGWYCATDS 
PEHRTVNVRVFLVGIVLLIL 

TTCACAGACTGAAGATTGTTGAAATTGTGTACCAAAAAGGAGATGTCCATGCGGATACTT 8 4 n 

SQTEDC*NCVPKRRCPCGYF 
HRLKIVEIVYQKGD VHADTL 

TGGTGGAATAGATGAACAAGGAAATCCTATTTGTAAATCGTGTTGTGTTGGTGAATATTG 
W WNR-TRKSYL 'IVLCW-IL 
G GIDEQGNPICKSCCVGEYC 
VE*MNKEILFVNRVVLVNIA 

CGACTACCTACGTAATTATAGACTTGATCCATTTCCTCCATGCAAACTATCTAAA 

TTYVIIDLIHFLHANYLNVI 

TTAATTATCyVTTTp^TGATAATGTTACCATACArr^^ ^020 
I'IMILMIMLPYIISLLG«CI 
'^'*'*CYHTLYRYLVSV- 



900 



960 



1080 



1140 



ATTCAGTATGAAGACCTATTAATAATTACTTATCTTTTGACGATCTTGTTATA^^ 
I QYEDLLIITYLLTILL-L" 

S V • R p I N N Y L S F D D L V 'l ^ ^ ^ 

TATAAAAATACTTATGGCATAGTAACTCATAATTGCTGACGCGATAAATTCGTAATAATC 
% v . ^°^^^"^^*RDKFVI I 
IKILMA-»LIIADAINS* * S 
'KYLWHSNS-LLTR*iRNN L 

^^^^,7^r™"^^^^TCTACAGGCATAAAAATAAAAATATAATTTATAA 1200 
C FVQj.FIRNLQA*K«KYNT . 

^ ^ ^ ' ^ ^ - ^ f-- N K N I I "y N 

FCSNFYKESTGIKIKI ' F I X 

TATACTCTTACAGCGCGCCATCATGAATAACAGCAGTGAATTGATTGCTG 1 9*;n 

YTLTARHHE'QQ*iDC 
ILLQRAIMNNSSELIA 
YSYSAPS'ITAVN*LL 
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ATATTTGGTATTACCGCATTAATTATATTGTCGGCCGTGGa\ATTTTCTGTAT^ 6 0 

IFGITALIILSAVAIFCITY 
YLVLPH*LYCRPWQFSVLHI 
IWYYRINYIVGRGNFLYYIL 

TATATATATAATAAACGTTCACGTAAATACAAAACAGAGAACAAAGTCTAGATTTTTm^ 120 
YIYNKRSRKYKTENKV*IFD 
lY I INVHVNTKQRTKS RFLT 
YI**TFT*IQNREQSLDF*L 

TTACATAAATGTCTGGGATAGTAAAATCTATCATATTGAGCGGACCATCTGGTTTAGGAA 180 
LHKCLG**NLSY*ADHLV*E 
YINVWDSKIYHIERTIWFRK 
T*MSGIVKSIILSGPSGLGK 

AGACAGCCATAGCCAAAAGACTATGGGAATATATTTGGATTTGTGGTGTCCCATACC^ 240 
RQP * PKD YGNI F GFVVS HTT 
DSHSQKTMGIYLDLWCPIPL 
TAIAKRLWEYIWICGVPYH* 

AGATTTCCTCGTCCTATGGAACGAGAAGGTGTTGATTACCATTACGTTAACAGAGAGGCC 300 
RFPRPMEREGVDYHYVNREA 
DFLVLWNEKVLITITLTERP 
ISSSYGTRRC*LPLR*QRGH 

ATCTGGAAGGGAATAGCCGCCGGAAACTTTCTAGAACATACTGAGTTTTTAGGAAATA 360 
IWKGIAAGNFLEHTEFLGNI 
SGRE*PPETF*NILSF*EIF 
LEGNSRRKLSRTY*VFRKYL 

TACGGAACTTCTAAAACAGCTGTGAATACAGCGGCTATTAATAATCGTATTTGTGTGATG 420 
YGTSKTAVNTAAINNRICVM 
TELLKQL* IQRLLIIVFV'W 
RNF*NSCEYSGY**SYLCDG 

GATCTAAACATCGACGGTGTTAGAAGTCTTAAAAATACGTACCTAATGCCTTACTCGGTG 480 
DLNIDGVRSLKNTYLMPYSV 
I*TSTVLEVLKIRT*CLTRC 
SKHRRC*KS*KYVPNALLGV 

TATATAAGACCTACCTCTCTTAAAATGGTTGAGACCAAGCTTCGTTGTAGAAACACTGAA 540 
YIRPTSLKMVETKLRCRNTZ 
I* DLPLLKWLRPSFVVETLK 
YKTYLS^NG-DQASL'KH'S 
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Fig.20(Cont). 



600 



GCTAACGATGAGATTCATCGTCGCGTGATATTGGCAAAAACGGATATGGATGAGGCCAAC 
"NDEIHRRVILAKTDMDEAN 
L T M R F I V A ' Y W Q K R I M R P - 

'R*DSSSRDIGKNGYG*GQ*R 

GAAGCAGGTCTATTCGACACTATTATCATTGAAGATGATGTGAATTTAGCATATAGTAAG 660 
£-AGLFDTIirEDDVNTAY9t/ 
KQVYSTLLSLKMM - l"- H I V S 
S R S I R H Y Y H - R . c E F S I - - v 

TTAATTCAGATACTACAGGACCGTATTAGAATGTATTTTAACACTAATTAGAGACTTAAG 720 
L I QILQDRIRMYFNTN-RLK 
M o ^ ^ ^ ^ ^ ^ ^ ^ I L ' L I H D L R 

NSDTTGPY-NVF'H'LET- D 

ACTTAAAACTTGATAATTAATAATATAACTCGTTTTTATATGTGGCTATTTCAACGTC^^ 7g0 
T v^^^*^^^^^^^^^"WLFQRL 
T v^ r^'^^^'^^^^CGYFNV. 
LKLDN«*YNSFLYVAISTSN 

ATGTATTAGTTAAATATTAAAACTrACCACGTAAAACTTAAAATTTAAAATGATATTTCA 
MY*LNIKTYHVKLKI*NDIS 
C I S-ILKLTT'NLKFKMIFH 
VLVKY-NLPRKT«NLK*YFI 

TTGACAGATAGATCACACaTTATGAACTTTCAAGGACTTGTGTTAACTGAC^ 
L T DRSHIMNFQGLVLTDN C K 
*QIDHTL*TFKDLC*LTIA K 
°R*ITHYELSRTCVN-QLQ K 



840 



900 



960 



AATCAATGGGTCGTTGGACCATTAATAGGAAAAGGTGGATTCGGTAGTATTTATACTACT 

NQWVVGPLIGKGGFGST v ^ 

INGSLDH * *EKVDSVv'f*^*L% 
SMGRWTI NRKRWIR* YL^Y Y " 

AATGACAATAATTATGTAGTAAAAATAGAGCCCAAAGCTA nnon 
NDNNYVVKIEPKA 
MTIIM**K*SPKL 
"■Q'LCSKNRAQSS 
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ACCATCGAGGTPJ^CCACCrCTCTGGPJ^GACAGCGTGPiATAATGTAC^ 
TIEVTTSLED.SVNNVLMKRL 
PSR*-PPLWKTA*IMYS*NVW 
HRGNHLSGRQRE»CTHETFG 

GAAACTATACGCCATATGTGGTCTGTCGTATATGATCATTTTGATATTGTGAATGGTAAA 
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